Reverse-Safe Text Indexing

نویسندگان

چکیده

We introduce the notion of reverse-safe data structures. These are structures that prevent reconstruction they encode (i.e., cannot be easily reversed). A structure D is called z - when there exist at least datasets with same set answers as ones stored by . The main challenge to ensure stores many useful queries possible, constructed efficiently, and has size close original dataset it encodes. Given a text length n an integer , we propose algorithm constructs -reverse-safe ( -RSDS) O(n) decision counting pattern matching most d optimally, where maximal for any such -RSDS. construction takes O(nɷ log d) time, ɷ matrix multiplication exponent. show that, despite nɷ factor, our engineered implementation only few minutes finish million-letter texts. also plugging method in analysis applications gives insignificant or no utility loss. Furthermore, how technique can extended support under realistic adversary models. Finally, -RSDS queries, whose sublinear preliminary version this article appeared ALENEX 2020.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Indexing Compressed Text

As a result of the rapid growth of the volume of electronic data, text compression and indexing techniques are receiving more and more attention. These two issues are usually treated as independent problems, but approaches of combining them have recently attracted the attention of researchers. In this thesis, we review and test some of the more effective and some of the more theoretically inter...

متن کامل

Improved Dynamic Text Indexing

In the dynamic text indexing problem, a text string has to be maintained under string insertions and deletions in order to answer on-line queries about arbitrary pattern occurrences. By means of some new techniques and data structures, we achieve improved worst-case bounds. We show that nding all pocc occurrences of a pattern of length p in the current text of length n takes O(p + pocc + upd lo...

متن کامل

Kaleta SEMANTIC TEXT INDEXING

The following article presents a specific issue of semantic analysis of texts in natural language – text indexing and describes one field of its application (web browsing). The main part of this article describes a computer system assigning a set of semantic indexes (similar to keywords) to a particular text. The indexing algorithm employs a semantic dictionary to find specific words in a text ...

متن کامل

Text Indexing with Errors

In this paper we address the problem of constructing an index for a text document or a collection of documents to answer various questions about the occurrences of a pattern when allowing a constant number of errors. In particular, our index can be built to report all occurrences, all positions, or all documents where a pattern occurs in time linear in the size of the query string and the numbe...

متن کامل

Universal Compressed Text Indexing

The rise of repetitive datasets has lately generated a lot of interest in compressed self-indexes based on dictionary compression, a rich and heterogeneous family that exploits text repetitions in different ways. For each such compression scheme, several different indexing solutions have been proposed in the last two decades. To date, the fastest indexes for repetitive texts are based on the ru...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: ACM Journal of Experimental Algorithms

سال: 2021

ISSN: ['1084-6654']

DOI: https://doi.org/10.1145/3461698